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(57) Abstract: A device, system and methods of data management are disclosed, which facilitate the implementation of improved 
mirroring, back-up, volume remapping, extent relocation, prefetching, caching, data reformatting, statistic gathering, and data trans- 
lation, among others. A new, intelligent I/O stream splitter is disclosed that may intercept and alter an I/O stream received by the 
splitter from a communications link. For example, in the case of mirroring, the intelligent splitter may intercept write commands 
and associated data from a mainframe that target a specific storage location on a specific control unit The splitter may then transmit 
the intercepted I/O stream to the targeted control unit and storage location over one link and in parallel transmit on another link an 
altered version of the intercepted I/O stream to another control unit, which is responsible for holding a mirrored version of the data. 
The altered version could have the same data as that on the one link but have new frame headers injected at appropriate points in 
the I/O stream. Moreover, the intelligent splitter may improve the functionality of extended functions such as mirroring as well. 
The alterations to the I/O stream to the mirroring control unit may include changes to the control information of the I/O stream, 
for example, to address the mirroring control instead of the primary unit. But the alterations may also include changes to the data 
itself, including formatting, filtering and translating. A host may be connected to potentially several intelligent splitters to implement 
extended functions cooperatively. Among other things, the host may be used to configure the splitter to perform the mirroring (or 
other) operation at a physical level of addressing, though the host may receive requests expressed at a logical level. The host may 
then map the logical name to physical addresses and configure the splitter accordingly. The host may also instruct the sputter to 
create and send streams independently of any I/O streams passing through the splitter. This created I/O stream may correspond to 
an entire CCW in ESCON contexts. 
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INTELLIGENT SPLITTER, SYSTEM. AND METHODS OF USE 

Background Of The Invention 

The present invention relates to data management. More specifically, the present 
invention relates to methods and apparatuses lor managing large amounts of data, for example, ui 
storage area networks and mainframe I/O environments. 

The demands for managing large amounts of data have steadily increased in recent years 
and arc expected to continue to increase in the future. For example, large organizations such as 
airlines and financial institutions require continuous, reliable, around-the-clock access to their 
"mission critical" data. Temporary interruptions in the accessibility to this data, or the loss of 
pontons of this data, can be catastrophic to such organizations. Complicating the management 
task, many organizations have an enormous and growing amount of mission critical data (e.g., 
many terabytes). Much of this data is managed by mainframe-based computer systems. 

Figure 1 shows a block diagram of an exemplary prior an computing system 100, which 
is described here to illustrate common management lasks and associated problems. System 100 
includes two mainframe computers 1 1 OA, 1 10B, three direct access storage devices (DASDs, 
also known as ' control units") 120A, 120B, 120C, and a director 150. The various components 
communicate with one another via ' point-iorpoint" communication links 160, 162, 164, 166, 
168, 170, and 172 according to a defined protocol. The common protocol is the ESCON 
protocol, also known as the SBCON protocol (hereinafter, collectively called "ESCON"). 

In this exemplary system, port 1 14A of computer 1 10A is coupled to port 122B of control 
unit 120B via link 162. Pon 1 12B of computer 1 1 0B is coupled to port 124B of control unit 
120B via link 164. Port 1 14B of computer 1 10B is coupled to a port (not shown) of director 150 
via link 166. Port 126B of control unit 120B is coupled to another port (not shown) of director 
150 via link 168. Port I24C of control unit 120C Is coupled to a port (not shown) of director 150 
via link 170. In each case, the physical link and protocol are ESCON compliant. Finally, pon 
PI of control unit 1 20 A is coupled to pon P2 of control unit 120B via a proprietary 
communication link 1 72, in which the physical link is ESCON but which is used to carry 
proprietary commands and data, for example, to implement certain "extended functions" (more 
below). For convenience of illustration, each computer I I0A, ! 10B is illustrated as including 
two ESCON ports 1 12, 1 14. 
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In the illustrated embodiment; ihC'ttiainlramc computers arc IBM S/39(>$. Exemplary 
IBM S/390 mainframe computer may include between sixteen to 256 ESCON communication 
pons. 

Each exemplary DASD control unit 120, for case oi description, is shown as including 
three ESCON communication ports 122, 124, 126, and optionally additional private links' PI, P2. 
though a typical DASD control unit may include between 2 and 64 ESCON pons. The 
illustrative DASD control units 120 include a main memory 130, a controller 132; a persistent 
storage 134, and three memory buffers 123, 125, 127, each buffer being associated with a 
corresponding ESCON port. Each pon can write data into, and read data out of, its associated 
buffer. The controller 1 32 can write data into, and read data out of, all of the buffers or move 
data to or from persistent storage 134 via an internal system bus 136. 

A director 150 improves connectivity in a storage network by allowing one mainframe 
computer pon to connect to two or more control units. 

As mentioned above, the various components may communicate using the ESCON 
protocol. Under ESCON, the components communicate according to "chains" of one or more 
channel command words (CCWs). Each.CCW, in turn, is communicated in three phases: a 
"command phase/' a "data phase," and a "status phase" with each phase using a known 
vocabulary of messages. During each phase, information is transmitted as "frames/* which arc 1 
kilobyte or less in size and include control (or header) arid data (or payload) portions. A given 
phase may involve known How control and/or handshaking and may involve many frames. For 
example, the protocol permits 64 kilobyte transfers, which could require 64 frames during the 
daia phase. The data phase uses a flow control technique in which an initiator expresses a desire 
to transmit or read a certain amount of data (e.g., in a prior write command), arid the receiver 
replies with a data request message indicating the size of data that may be sent by the transmitter 
and received by the receiver. A series of such requests may be needed to transfer the entire 
"exchange/* 

The I/O protocols rely on a concept oi virtual links connecting "virtual mainframe 
machines'" with "virtual control units. Some of the I/O protocols, which are connection oriented, 
like ESCON and SCSI, allow only one connection to be active at any moment in time, while- * 
others may actually frame multiplex ihc information among the various virtual links. Virtual 
links arc effectively identified by the frame header information specifying both physical and 
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logical addresses, and the components can dcteci virtual connections and disconnections from 
analyzing specific bits in certain frames. 

As alluded to above, commercially available control units offer "extended functions." 
Extended functions implement features above and beyond basic device operations like read or 
write. (The actual functions implemented by a device are defined in the device specification, 
such as a specification of a control unit.) For example, two popular extended functions arc 
known as "concurrent copy" and "remote copy/' which are used, respectively, for maintaining 
backup copies or for "mirroring" data to other storage as it is written to its target. Known 
extended functions operate at a physical level of addressing (e.g., volume numbers and tracks) as 
opposed to operating at the logical level (e.g.; files or the like). Referring back to Figure I, a 
control unit may perform back-up to another disk controlled by another control unit by using a 
dedicated ESCON link 572, connecting the two control units. Proprietary software (sometimes 
referred to as firmware), executing on the control units, performs the necessary operations over 
the link 172 to send the data to be backed up from one control unit to another. 

In the above approach to mirroring, data ineffectively written to the control units 
sequentially, first to the primary control unit and then from the primary control unit to the control 
unit doing the mirroring. This introduces.delay and complication as the data is written between 
the control units. The backup approach is also sequential. These approaches require dedicated 
communication links 172 that cost port connections on the control units. 

Moreover, because prior art extended functions are built using proprietary embedded 
software (also known as "firmware") to and between control units, third parties cannot 
practically create additional functions for the control units. To date, the extended functionality is 
largely limited to homogenous systems of control units. Thai is t the extended functions 
generally do not work when control units from different manufacturers are involved in a 
network. 

Clustering is similar to mirroring in that some data is effectively mirrored to storage 
associated with another processor. However, rather than mirroring information in case a 
subsequent failover or switchover to another storage proves necessary, clustering usually 
involves mirroring (or replicating ) only specific information so that the processors may act 
collaboratively and in distributed fashion. 
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Summary Of The Invention 

The inventions provides devices, systems, and methods of replicating and manipulating 
I/O in formation to improve efficiency and functionality. Preferably, the invention intercepts I/O 
information as it is transmitted between a computer (e.g., mainframe) and storage system (e.g., 
DASD storage controller). 

Under certain aspects of the invention, an intelligent splitter device includes 
a buffer memory and at least three input/output (I/O) ports. Each port includes logic for 
transmitting and receiving information on an associated input/output (I/O) communication link, 
and each port is in electrical communication with the buffer memory. Control logic controls 
each port to write I/O data received from its respective I/O communication link into a portion of 
the buffer memory associated with that port. The control logic also controls each port to transmit 
data onto its respective I/O communication link by reading data from the buffer memory at 
selectable buffer memory locations, including locations associated with one of the other ports. 

Under another aspect of the invention, the control logic can create both control and data 
frames. These frames may be created independently of any of the streams received by the 
intelligent splitter device. 

Under other aspects of the invention, an intelligent splitter device communicates 
according to a frame-based I/O protocol in which each frame includes a header component and a 
payload component. The device includes a buffer memory and at least three input/output (I/O) 
pons. Each port has logic for transmitting and receiving information on an associated 
input/output (I/O) communication link, and each port is in electrical communication with the 
buffer memory. Control logic controls each port to write I/O data received from its respective 
I/O communication link into a portion of the buffer memory. And the control logic further 
includes logic to analyze a received frame and in response thereto create and transmit a modified 
version of the received frame on one of the I/O ports. The modified version of the frame 
includes a modification to at least one of the header component and the payload component of 
the received frame. 

Under other aspects of the invention, an intelligent splitter device is used for 
communicating according to a multiphase I/O protocol having a command phase, a data phase, 
and a status phase in which the data phase has at least one data frame. The splitter includes a 
buffer memory and at least three input/output (I/O) ports. Each port has logic for transmitting 
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and receiving information on an associated input/output (I/O) communication link, and each port 
is in electrical communication with the bul'icr memory. Control logic causes at least two of the 
ports to transmit in parallel data phase information. 

Under another aspect of the invention, an intelligent splitter device is used for 
communicating according to a multiphase I/O protocol having a command phase, a data phase, 
and a status phase, in which the data phase comprises at least one data frame. The splitter 
includes a buffer memory and at least three input/output (I/O) pons. Each port has logic for 
transmitting and receiving information on an associated input/output (I/O) communication link, 
and each port is in electrical communication with the buffer memory. Control logic causes at 
least one port to, transmit control phase information stored in the buffer. In parallel therewith, the 
control logic also causes at least one other port to transmit a modified version of the control 
phase information that is transmitted on the first port. 

Using the above aspects, I/O information can be received on a first port of the 
communication device having at least three ports. The communication device can transmit the 
I/O information received on the first port to a second port of the communication device, and the 
communication device can transmit a modified version of the I/O information received on the 
first port to a third port ofthe communication device. 

Moreover I/O information can be received on a first port of the communication device, 
and the communication device can transmit a modified version of the information to one of the 
second and third ports of the communication device. Pr, at least, selected I/O information can be 
cached in the communication device, and I/O information can be received on a first port ofthe 
communication device. The received I/O information can be analyzed to determine physical 
address data ofthe I/O information, and the cached I/O information can be transmitted on the 
first port if the received information hits the cache. The received information can be transmitted 
on one of the second and third ports if the received information misses the cache. 

Under another aspect of the invention, an intelligent splitter system includes an I/O 
device and a host system. The I/O device includes input/output (I/O) port logic for transmitting 
and receiving information on at least three I/O communication links according to a standardized 
I/O protocol. Under the protocol I/O operations may be requested on storage locations identified 
by physical address information. The I/O device further . includes extended function logic that 
cooperates with the port logic to perform at least one extended function not specified in the 
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standardized I/O protocol The ai least one extended function operates in a physical address 
domain ol physical addresses. The host system has a processor and memory storing processor- 
executable instructions to map logical storage object names to a physical address domain. It also 
stores instructions to configure the I/O device with a mapped physical address domain to enable 
the I/O device to perform the at least one extended function on the physical address domain 
corresponding to a mapped logical storage object name. 

Under other aspect of the invention, a command can be received to perform an extended 
function on data identified by a logical object name. The logical object name can be mapped to a 
physical address domain, and a communication link in the I/O system can be monitored to 
determine if the link is carrying an I/O operation in the mapped physical address domain IT an 
I/O operation is being carried on the link within the mapped physical address domain, that ' 
operation can be intercepted and the extended function identified in the received command can 
be performed. 

Under another aspect of the invention, I/O information is received from one of the two 
communication links into a first multi-port device having a processor and memory. The memory 
has instructions to determine if I/O information received oh the link is within a first predefined 
set of I/O operations, and the first multi-port device is connected to a first storage system. I/O 
information is also received from a second of the two communication links into a second multi- 
port device having a processor and memory. This memory also stores instructions to determine 
if I/O information received on the link is within a second predefined set of I/O operations, and 
the second multi-port device is connected to a second storage system. If the received I/O 
information on the one communication link is within the first predefined set of I/O operations, 
the received I/O information is sent to the first storage device and an I/O operation is sent to a 
port of the second multi-port device. If the received I/O operation is not within the predefined 
set, the received information is sent to the first storage system. Similar actions to the above take 
place with respect to the second link. 

Brief Description Of The Figures 
In the Drawing, 

Figure I is a system diagram of an illustrative prior art computer system including control 
units and other devices; 
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Figures 2A-D show ihc architecture of.iniclligent splitter devices in communication wilh 
a host according 10 a preferred embodiment of the invention: 

Figures 3A-B arc diagrams of exemplary sysicms using intelligent splitters according to a 
preferred embodiment of the invention; 

Figure 4 shows a bus utilization scheme according to a preferred embodiment of the 
invention; 

Figure 5 shows a software architecture according to a preferred embodiment of the 
invention; 

Figures 6-8 show an exemplary system architecture according to a preferred embodiment 
of the invention; and 

Figures 9- 10 show an exemplary system architecture according to a preferred 
embodiment of the invention particularly .suitable for clustering. 

Detailed Description Of The Preferred Embodiments 

Overview 

Preferred embodiments of the invention facilitate the implementation of many useful data 
management applications and functions, such as improved mirroring, back-up, volume 
remapping, extent relocation, prefetching, caching, data reformatting, statistic gathering, and data 
translation, among others- 
Referring to Figure 2 A, to improve the efficiency of many of these functions and 
applications, preferred embodiments use a new, intelligent I/O stream splitter 200 that may be 
programmed to intercept and alter an I/O stream received by the splitter from a communications 
link 205. For example, in the case of mirroring, the intelligent splitter 200 may be programmed 
to intercept write commands and associated data from a mainframe 1 10 that target a specific 
storage location on a specific control unit 320. The splitter 200 may then transmit the 
intercepted I/O stream to the targeted control unit and storage location over link 206 and in 
parallel transmit on link 207 an altered version of the intercepted I/O stream to another control 
unit 330. which is responsible for holding a mirrored version of the data. The altered version 
could have the same data as that on link 206 but have new frame headers injected at appropriate 
points in the I/O stream. Transmitting the I/O streams in parallel is more efficient than the 
sequential approach used in the art (e.g., from a mainframe host to a primary control unit, and 
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from ihc primary control unit 10 a mirroring unii over a dedicated link): Not only is the parallel 
approach taster n avoids the use or dedicated inter-control unit communication links. 

Moreover, the intelligent splitter may improve the functionality of extended Junctions 
such as mirroring as well. The alterations to the I/O stream to the mirroring control unit 330 may 
include changes 10 the control information ol" the I/O stream, lor example, to address the 
mirroring control instead ol the primary unit 320. But the alterations may also include changes 
to the data itself. For example, if the mirroring control unit 330 organizes data differently (e.g., 
different size chunks) than the primary control unit 320, the splitter 200 may be programmed to 
format the intercepted stream to a form appropriate for the mirroring control unit 330. In 
addition, the splitter may be programmed to filter the I/O stream (e.g., detect and remove specific 
data patterns) or translate it (e.g., changing EBCDIC to ASCII). Thus, unlike the prior an, the 
splitter 200 permits functions to operate with multiple, heterogeneous control units and to have 
write visibility into the I/O stream itself to alter both the header and payload components of 

frames. vv - " v • -' 

Preferred embodiments further include a host 260 connected to potentially several 
intelligent splitters. The host and splitters may be programmed to implement extended functions 
cooperatively. Among other things, the host may be used to inform a splitter of the functions 
and applications that ihc splitter 200 should perform. For example, in the above case, the host 
260 might instruct a splitter 200 to watch I/O streams passing through the splitter and to take the 
above described transmission and alteration operations if the I/O stream being watched falls into 
a physical address range of interest, e.g., the range thai is desired to be mirrored. Thus, the host 
260 may be used to configure the splitter to perform the mirroring (or other) operation at a 
physical level ol* addressing. • 

In addition, the host 260 may instruct the intelligent splitter 200 to create and send 
streams independently oi any I/O streams passing through the splitter. For example, to 
implement a form of backup copying, the host 260 might instruct the splitter 200 to create and . 
send an I/O stream to a control unit 320, to read a specific track of data and to store that data in . 
the splitter 200 and/or the host 260, This created I/O stream may correspond to an entire CCW 
in ESCON contexts. Afterwards, the host 200 might instruct the splitter 200 to create another 
I/O stream to write that data to another control unii 330 that is responsible for holding, backup 
data. 
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The host 260 may also he used to communicate wiih file systems 290 and computer 
systems 280 using other communication and storage protocols. For example, the host may 
communicate with an open systems file server 280 using TCP/IP to obtain metadata about Tiles 
managed by that server. The metadata may then be used to map logical objects (e.g., files, 
directories) with physical objects and addresses (e.g., tracks and volumes). This mapping 
information may then be used to implement the above-type of extended functions and 
applications on a logical level. For example, the host 260 cooperating with a splitter 200 may 
implement caching of files by Tile names, or back-up information to an open systems file server 
280 using filenames. In addition, the host 200 may provide a communication path for alternative 
protocols. For example, the host 260 may communicate information with the splitter 200 and 
thus have access to information in control units 320, 330 connected to that splitter, but it also 
may coordinate communication with other file systems, e.g., 290, using other protocols such as 
fibre channel (FC). The host 260 may also be used to implement web-based consoles 270 to 
administer the system. 

Preferably, the splitter 200 and host 260 arc implemented with defined application 
program interlaces (APIs) to allow third parlies to develop and introduce applications and new 
extended functions into a system. By implementing the extended functions and applications in 
the splittep200 and/or the host 260, not only are new, useful functions more easily realized, they 
are decoupled from the control units. Thus, less complicated and sophisticated control units may 
be used in the system, thereby reducing ownership cost for an overall storage system. 

As outlined above, preferred embodiments may be used to improve the functionality and 
efficiency of many applications and functions. One application not mentioned above but that can 
benefit from preferred embodiments is clustering. In particular, preferred embodiments may be 
used to receive information passing between a mainframe and a control unit. The splitter may be 
used to detect a subset of this information that is of interest for a clustering environment and pass 
that information (modified or unmodified depending on the application and environment, e.g., 
type of equipment used) to another computing system in volved in the cluster. Under, this 
arrangement, the splitter may be configured to work directly with a control unit, thus potentially 
obviating the need for a device that can communicate on at least three I/O ports. Instead, only 
two I/O ports are needed (e.g., ones that can communicate according to a defined multi-phase 
protocol) and a private communication path to a control unit are needed. 
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Hardware Architecture for Intel l uLSntMUlULl' 

The intelligent spliucr 200 of a preferred embodiment is a mulliport programmable 

device that has visibility into I/O streams passing through it and that can alter the streams under 
programmatic control. As will be explained in more detail below, the splitter may receive and 
transmit I/O streams on multiple ports. In one embodiment, the streams are received into a 
buffer that a local processor may access. Programs running on the processor may then analy/x 
the streams and retransmit them on potentially multiple ports and it may potentially alter the 
stream before doing so for example by injecting headers and/or altering payloads of frames. 
Moreover, the programs may create and send I/O streams independently of any received stream. 

Figure 2B is an architectural diagram of a three port intelligent splitter 2(H) according to 
one embodiment of the invention. Intelligent splitter 200 includes Port A, Port B, Port C,a 
common buffer 220. a local processor 230, and a local processor memory 240, communicating 
via a local bus 250, 

Ports A, B, and C communicate with external links 210A, 2I0B, and 2 IOC to receive and 
transmit data over the link, for example, according to the ESCON protocol. In this fashion, Pons 
A, B, and C may participate in a communication having a chain of CCWs. This communication 
may be with a mainframe host, DASD control unit(s); a director or other appropriate devices 
depending on the configuration of the system or network; In addition, Ports A, B, and C each 
communicate with common buffer 220 via bus 214: In one embodiment, each Port has read 
visibility into the entire buffer 220, but has write visibility limited to specific buffer areas 
associated with the Port. 

The common buffer 220 is used to store frames received from a link via a Port. In 
addition, the buffer 220 may hold frames created by processor 230. 

The local processor 230 runs software in memory 240 to control the splitter 200 to provide 
"intelligence" to the splitter 200. For example; the processor may run software that can read and/or write 
state to the Ports to control their operation. In addition, since the processor 230 may communicate with the 
common buffer 220, programs may be run id read and/or write information to the common buffer 220. 
These latter programs, for example, may be us^d to analyze or alter received I/O streams or to create stream 
information for transmission (i.e., independently of a received stream). Moreover, as will be explained 
below, the processor 230 also can communicate with a host system 230 that may be shared with other 
splitter devices. 
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Local bus 250 facilitates ihc communication among the processor 230. the common 
butter 220, Port A, Port B. Port C. and the processor memory 240. The bus allows interrupt, 
command, address and data information to be passed among the various entities communicating 
with the bus 250. In one embodiment, each entiiy on the bus 250 has a unique address space, in 
thisjashion, processor 230, lor example, may write state to Port A by sending a write command 
on the local bus 250 with an address lhaiPort A will recognize as its own. 

Though many bus access schemes arc suitable, one embodiment uses a time division 
multiplexing scheme lor bus 214 in which each Port has dedicated bus time slots and in which 
reads arc grouped together and writes are grouped together, sec, e.g., Figure 4. This bus access 
approach assures that each Port will have sufficient buffer bandwidth to handle the I/O streams. 
In this fashion, each of the Ports may read data or write data only during a predefined limcslol. 

Figure 2C shows an exemplary architecture for common buffer 220 in more detail. In 
this arrangement, the unique address space of common buffer 220 is subdivided so that each Port 
and processor 230 is associated with a unique sub-address space (or their respective write 
operations to the buffer 220. Each of the Ports and the processor 230 can read data from any 
location in the common buffer 220. For example, the common buffer 220 may be logically 
divided into four equally-sized, non-overlapping, memory segments 220A, 220B, 220G, and 
220D with Port A associated with memory segment 220A, Port B associated with memory 
segment 220B, Port C associated with memory segment 220G, and processor 230 associated with 
segment 220D. For an embodiment particularly suitable for ESCON, each Port is an ESCON 
port, and the buffer 220 is at least 256 Kb in size with each Port-associated buffer segment being 
at least 64 Kb. In this embodiment, the bus 2 1 4 is capable of operating at a bandwidth of at least 
120 Mbytes/sec to support three ESCON pens. For example, the bus 214 and buffer 220 may 
have a 32-bit width and each must be capable of executing 30 Mega-operations per second. 

In one embodiment, the address space subdivision is accomplished with hardware 
assistance. Each buffer segment during write operations from a Pen (e.g., during a receive 
operation on the respective link) addresses the segment with a programmable looping buffer . 
address counter that effectively makes the Port-associated buffer segment a circular buffer for 
receive operations from a Port (i.e., write operation to buffer 220). The ' circling" address values 
corresponds to the segment's address space. As will be explained further below, software may 
write to certain registers within a given Port and '.hus control the write operations; for example, 
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software may write u> registers in a Port to indicate a starting buffer address at which to start 
writes. 

Figure 2D shows limner detail of an exemplary intelligent splitter 200. In this 
embodiment, each of Ports A, B. and C includes transmit logic 21 1; receive logic 212, and 
control logic 213. The transmit logic 21 1 is primarily responsible tor sending data to an external 
link; the receive logic 212 is primarily responsible lor accepting data from an external link, e.g., 
210A; and the control logic 212 is responsible tor coordinating the actions of the transmit logic 
21 1 and the receive logic 212. This coordination may be controlled by software executing on 
local processor 230, as further described below. 

As stated above, the transmit logic 21 1 is primarily responsible lor sending data to an 
external link. In one embodiment, the transmit logic 21 1 operates in response to control registers 
in control logic 213. The frame information to be transmitted (both header and at least some 
payload) reside in buiicr 220. (Only some of the payload need be present to implement 
streaming, as explained below). The frame information may be identical to that received from 
one of the Ports and stored in the buffer 220, or it may be modified by software from such frame 
information, or it may be independently created by software. By setting a "start transmission" 
bit in a control register (not shown), the transmit logic 21 1 begins to transmit a frame(s) onto a 
corresponding link. In particular, the transmit logic reads data trom the buffer 220 starting at an 
address loaded in a "frame header starting address" register, and as the data is read, it is 
transmitted through the transmitter logic onto the link. This reading and sending Ls for a 
specified amount of data (e.g., ESCON has 9 byte headers). Afterwards, the transmit logic 2 1 1 
reads and transmits data from the buffer 220 starting at an address loaded in a "frame payload 
starting address" register for a specified amount of data (e.g., indicated in a length register). 
Once the frame Ls sent, the transmit logic 2 1 I sehds an interrupt to the processor 230. One 
embodiment of the transmit logic 2 11 includes scatter/gather logic: In this fashion, the data to be 
transmitted on a link need not be in contiguous space in buffer 220. Thus, this logic facilitates 
transmissions in which a frame header and a- payload- are in discontiguous space in buffer 220, 
and may facilitate the support of protocols having virtual links. 

As stated above, the receive logic 212 Ls responsible for accepting data from an external 
link, e.g., 210A. As data is received from a link, the receive logic 212 writes the frame data into 
the buffer 220 at a starting address specified in a control regLster (not shown). The starting 
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address of the control register may be sci by software As writes occur, the write address register 
increments circularly corresponding to that Pon s address space in butter 220, and a byte length 
value is incremented accordingly. The receive logic 2 i2 includes interrupt logic to signal data 
transfer events to processor 230. One embodiment allows the software to select the type ot'daia 
transfer event to cause an interrupt by selling appropriate bits in a control register. For example, 
an interrupt may be sclcctably sent when a frame has been received without error, or an interrupt 
may be sclcctably sent when a specified byte count has been reached (e.g., to signal the reception 
of a frame header or to signal that an appropriate amount of a data frame has been received to 
begin data streaming, as explained below), in addition, the receive logic can distinguish between 
data frames and non-data frames being received. This distinction may be conveyed as 
distinguishable frame receive interrupts or by holding pollable state indicating the type of frame 
received. In one embodiment, the receive logic may be configured to send an interrupt when a 
tirsi data frame is received, and/or to send an interrupt when all data in an exchange has been 
received. (These interrupts arc described in further detail in a later section.) One embodiment of 
the receive logic 212 implements write overflow protection to detect if an initiator has attempted 
more data than it was permitted and to prevent overwriting in such case. Depending on the 
implementation^ the receive logic 212 might perform other functions such as generating parity or 
ECC bits for the data written to buffer 220. 

The control logic 213 includes the various registers described above to control the 
transmission and reception processes. The various registers may be mapped to their own unique 
address space and may be accessed by software executing on processor 230 via bus 250. (These 
control features arc also described in a later section.) The control logic also includes logic for 
detecting and generating interrupts as described above and for coordinating the access onto bus 
214 and bus 250. 

Many physical implementations of the above logic are realizable. One embodiment 
implements the logic for the three Ports A, B, and C, on a single Held programmable gate array 
(FPGA) chip; uses an Intel i960 processor chip or aMotorola PowerPC processor chip for 
processor 230: commercially-available memory chips for memory 240, and commercially- 
available dual-poned storage lor the common buffer 220. Ports may access the buffer 220 via 
one buffer port, and the processor 230 (and potentially a host 262 or DMA Engine 266) accesses 
buffer 220 through another buffer port. Preferably, Ports A, B, and C and the corresponding 
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links arc each lull duplex pons The spliticr 200 also includes other known hardware (noi 
shown) 10 assist in reception and transmission, such as delimiter bit detection and insertion, CRC 
processing and insertion, electrical to optical transceivers, parallel to serial conversion and the 
like. In addition, the Pon logic includes mechanisms lor detecting protocol-specified error 
conduions and tor sending interrupts in such cases. 

As outlined above, the Ports and other aspects of the splitter 200 arc visible on bus 250 
and thus to the processor 230. In addition, the components may communicate with the processor 
230 via interrupts and associated polling. The handling of such interrupts and the setting of 
appropriate Port state is the responsibility ol soli ware, in a preferred embodiment. Software 
executing on the splitter 2(K) is responsible for at least initial action on handling interrupts, but 
thus software may cooperate with other software on the splitter and/ or the host system 260, as 
described further below. 

Figures 2A-D show that exemplary intelligent splitter 200 may communicate with a host 
system 260 via a bus 268. This communication path may require separate logic 270, for example, 
to bridge the bus 268 with the local bus 250 of the intelligent splitter. Alternatively, the bridge 
logic 270 may be integrated into the processor 230 as is known in the art. Figure 2D illustrates a 
DMA Engine 266 separately as a convenient descriptive shorthand to show the relevant 
functionality; this logic may be separate, integrated in bus bridge hardware 270 or provided by 
the local processor 230. 

Although the host 260 is not part of intelligent splitter device 200, the intelligent splitter 
supports such communication so that more feature rich systems may be constructed by 
combining the splitter with a host 260 and possibly with other splitters, e.g., see Figure 7 
described below. The path 268 allows a host system to, among other things, read data from, and 
write data to, the common buffer 220 and the processor memory 240 either directly or indirectly. 
In this way, the host 260 may load software for the processor 230 to execute, may establish 
configuration parameters, and may pass messages to the processor 230. Though not shown jus 
such in Figure 2D, the host processor 262 and memory 264 arc pan of a complete host system 
260 that, among other things, includes non- volatile storage and a network connection(s)„see, 
e.g.. Figure 2A. 

Software Architecture 
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Under a preferred cmhtxJimcm, the splitter 200 operates on ; I/O streams at a physical address 
level, and the host 260 is responsible lor implcmcming. functionality at a logical level. Under this 
arrangement, ihc splitter 200 is configured by the host 260 to lake certain actions when I/O 
streams fall within prescribed, physical address domains (e.g., physical volumes, tracks, sectors, 
and physical blocks). This specification of a physical address domain and associated actions to 
be performed is called a "session definition." For example, with regard to a mirroring function, 
a session may define that all I/O streams received on Port A occurring within a specified physical 
domain (i.e., the domain to be mirrored) should result in that identical stream being sent on Port 
B (e.g., connected to the primary control unit 320)i and thai an altered stream be sent on Port C 
(e.g., connected to a control unit responsible for holding mirrored data); the alterations to be 
made to the stream lor Port C would be defined to include specific changes to control 
information to reflect the different address to be used for the mirrored data. 

In a preferred embodiment, the host 260 configures the splitter by communicating 
session definitions to the splitter 200. However, the hosL, unlike the splitter, may be 
programmed to understand a logical level of addressing abstraction (e.g., databases, file systems, 
logical records) and can map logical-lcvci objects to a corresponding physical address domain. 
Thus, the host can configure the splitter to implement extended functions at a physical level, but 
the host may be used applications and users to provide functionality at a logical level. For 
example, an administrator may instruct the host to mirror a file. The host would map this logical 
object to the physical domain and configure an appropriate splitter accordingly. 

Under a preferred embodiment, the software on the host and splitter may cooperate to 
send CCWs or a chain of CCWs on a link. Thus, the combination may be made to mimic a 
mainframe host or control unit. Thus ability to create and send CCWs is useful in implementing 
data management functions and applications. For example, with regard to the mirroring 
example, CCWs may be used to perform reads from the primary control unit and writes to the 
mirroring control unit to synchronize their relevant state before a mirroring session definition is 
invoked. 

By providing a defined API to the splitter functionality, third parties may implement 
useful applications and extended functions to operate on the system. In addition, by including 
logic to cooperate with agent technology (e.g., on file server 28()); the system may work with 
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"open systems*' and file systems oi disparate operating systems. In this fashion, the system may 
effectively blend of merge storage in disparate tile systems and from disparate environments. 

Figure 5 shows a more detailed diagram explaining the software architecture. It includes 
a link interface 505, an intelligent splitter subsystem 510, a host/splitter interface 525, and a host 
system 520. It also includes a web server subsystem and an agent module subsystem (not shown). 

Splitter/Port (or Link) interface 

The splitter/Port interface software 505 controls the link hardware (i.e., the Ports) ol the 

splitter 200 and is used by the splitter subsystem 5 1 0, described below. This interface is used for 

the transmission and reception of frames on splitter Ports. The link interface implements 

interrupt service routines (ISRs) and cooperating software to respond to and handle interrupts 

from the Pons. The ISRs cither process the interrupt internally (when a small amount of 

processing time is needed), or post requests and parameters on a system queue for later execution 

under the control of the task execution component 511, explained below. 

in one embodiment, data transmission of frames operates according to one of two 
transmission modes: ( ! ) data is transmitted on one Port, and (2) data is transmitted on two Ports. 
To implement these modes, link interface routine(s) may be used to set the appropriate state in 
the transmit logic to point to data to be transmitted and to set the appropriate start bits. As 
described above, this will cause the transmit logic to read data from buffer 220 and transmit it on 
appropriate Pons. The actual data sent will depend on the application state; e.g., it could be a 
received frame, an altered version of a received frame, or I/O stream data that was created by the 
splitter and host, as explained below. In one embodiment, the link interface 505 includes an ISR 
and/or cooperating software to respond to interrupts indicating that a transmission has been 
completed (see splitter description for how this interrupt is sent)- This software may then 
perform various memory management functions to monitor the free, usable space within buffer 
220. For example, such an interrupt may indicate that the space in buffer 220 that held the data 
just transmitted may now be used for other purposes. The link interface includes ISR(s) and 
routine(s) to handle any transmission errors generated during a transmission. 

In a preferred embodiment, the link interface 505 uses the hardware s ability to 
distinguish between the reception of non-data frames and data frames into the splitter 200. This 
distinction may be exploited to make the software operate more efficiently and to coordinate the 
reception and transmission processes. 
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When a non-data frame is received, a Port creates and sends a corresponding interrupt to 
processor 230, which causes a corresponding ISR and other link interlace software to be 
invoked. Thus, lor example, if frames for a command phase of an I/O stream arc received hy the 
splitter, this software will be invoked. Likewise, this software is triggered by a status frame, a 
command received frame, and the like. The link interlace reception software may then analyze 
the received frame data (e.g., by indexing to specific bits) to determine the precise control 
information conveyed by a mainframe, control unit, director, or the like. This might be useful to 
track the state and status of CCWs or CCW chains or to implement specific features of extended 
functions. For example, this might be used to verify that an acknowledgment frame is being 
returned in response to- a command frame and to analyze the contents of the acknowledgement. 
In addition, it might be used to implement a caching function, for example, by analyzing a write 
address to determine if it hits information cached in memory of host 260. 

When a data frame is received, a Pon.. may create and send a corresponding interrupt to 
processor 230, though not necessarily. More specifically,- in one embodiment. Ports generate an 
interrupt when a first data frame of an exchange is received but not subsequently for that data 
exchange. This interrupt, similarly to that described above, causes an ISR and cooperating 
software to be invoked to handle the first data frame event. Since each frame of an exchange 
may be handled identically, there may be no need to distinguish between the multiple data 
frames of an exchange or to handle the multiple frames individually as events. 

Link interface software may respond to this first data frame event in several ways. For 
example, the software may use this event to initialize transmit logic on specific Ports as called 
for by the application (more below) so that the transmii logic may be subsequently triggered into 
action to transmii frame data(for example, after an interrupt is received indicating that a specific 
amount of frame information has been received by a Port). Alternatively, the link interface 
software may both initialize the transmii logic and trigger it,, if a first -frame of received data is 
determined to be sufficient to start a transmii process. 

As alluded to above, before a data frame is received, the software has already received 
non-data frames, and thus has had an opportunity to obtain information about the data frames and 
exchanges to be received subsequently. For example; a write command frame, a corresponding 
write command acknowledgment frame, and a data request frame each may convey useful 
information about a subsequeni exchange of data frames. From ihis type of information, the 
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software can determine what actions should be taken (more below) and can determine whether 
particular data transfer modes may be appropriate. 

In one embodiment, before data frames actually arrive at the splitter; the software has set 
slate in the splitter Ports to correspond to one of three data transfer modes. In a fust mode, Port 
state may be set so that data frames are received only in the splitter's buffer 220. Thus might be 
useful for functions and applications in which the data needs to be translated, filtered or 
formatted before being transmitted on a Port. To implement this mode, the software may 
initialize the receive logic of the receiving Port for the reception of data frames The data will 
then be received in the buffer segment corresponding to the receive Port, and the software will 
copy and modify information as appropriate before sending it out on a transmit Pon under the 
control of software, for example, from the software on host 260 (more below). In a second 
mode, Port state may be set so that data frame data is received in the splitter's buffer 220 and 
also subsequently sent on a Port. This might be useful when a control unit is to receive an 
unmodified version of a data stream. A modified version of the frame data may be created from 
the received frame and sent subsequently to a different- Port, similarly to thai described for the 
first mode. For example, this mode may be useful in the mirroring example discussed above, in 
which a primary control unit received an unmodified stream, and a mirroring control unit 
received a stream that had been altered: To implement this mode, the software may initialize the 
receive logic of the receiving Port as explained above to receive data, and it may initialize the 
transmit logic in another Port so that that Port isready to start transmission of data. In a third 
mode. Port state may be set so that data frames are received in the splitter's buffer 220 and also 
subsequently sent on two outgoing splitter Ports. This might be useful when two control units are 
to receive an unmodified version of a data stream, e.g., a mirroring example in which the data 
frames are unaltered. The software would operate similarly to that for the second mode, but 
would initialize the transmit logic for two splitter Ports, as opposed to one. 

In any of the above modes, the receive logic may be initialized to generate an interrupt 
once a portion of the whole exchange is received (a data breakpoint). For example, this may be 
useful in the second and third data modes to implement a form of data streaming that reduces 
data latency an the buffer 220. By initializing the Port states with a data breakpoint, once the 
software receives a breakpoint interrupt; the software may then begin transmission of the data . 
frames on the other Port(s) by setting; appropriate "start bits" in the transmit Ports, even though 
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the enure frame or exchange 10 be transmitted is not yet tally received in the buffer 220 from the 
receiving Port(s). Data sircaming like this works because the receive logic and transmit logic 
both operate at protocol defined speeds. Thus; the transmit logic is assured of always reading 
valid data from the buffer 220. As outlined above, the software may initiate data streaming after 
a first data frame is received, or the software may be configured so that data streaming is 
initiated only after a specified amount of information has been received. Data streaming is 
advantageous in certain situations where the data to be transmitted is identical to data received 
(sec, e.g., mirroring application discussed above). 

The software also provides ah ISR to respond to an interrupt generated when an entire 
exchange has been received. This might be useful when an entire exchange needs to be modified 
or processed in some way. 

Intelligent Splitter Subsystem 

The intelligent splitter subsystem software 510 runs on each intelligent splitter 200. In 

one embodiment, the intelligent splitter subsystem 510 runs under the VxWorks real-time, 

operating system, which is responsible lor providing known, basic system services for the 

subsystem 5 10. The prime functionality ofsubsystcm 5 10 is to provide data and control access 

to mainframe storage and other devices connected to one of the Ports of a splitter device 200. 

This subsystem primarily operates at a physical level. It includes a task execution component 

51 1, a channel component 5 12, a filter subsystem 513, a formatter and translator subsystem 514, 

and a diagnostic subsystem 515. 

The task execution component 511 is responsible for executing the logic of the splitter 

subsystem 510. All the other components implement service routines that are invoked by the task 

execution component 511. 

Task Configuration and Execution 
The task execution component logic 5 1 1 is triggered by one of the following events: ( I) a 

service request from the host 260 through the host/splitter interface 525, described below (2) indirectly as 

a result of an interrupt from a splitter Port, processed by the Link interface (in this situation, the ISRs are 

foreground components, and the task execution component is a background component ): and (3) timer 

interrupts. 



- 19- 



BNSOOCIO: <WO 0101257A2J_> 



WO 03/01257 



PCTAJS00/17760 



As outlined above, the spline: is first configured through a host 260 10 define sessions, which 
associate prescribed physical address domains with certain actions that should be taken by the splitter 
software for I/O strcanis faHimj within ihc domain. More specifically, a session deli rut ion is established 
with multiple messages sent via the host/ splitter interface 525. One message updates internal data 
structures in splitter memory 240 to define the subject of the extended function at issue. The 
data structure includes information about (a) the e.ontrol unit address (b) the device address, and 
(c) the set o! tracks for which the extended function is to be. performed (for ESCON embodiments, or a 
set of blocks for a SCSI embodiment). This information may be stored in spltticr data structures and 
later used by the software lo determine whether an I/O stream received by ihc splitter falls into a 
registered domain. The host 260 also sends a message to the intelligent splitter 200 that identifies 
the logic to execute for the specific subject. In one embodiment, the following terms define the 
logic to execute: 

L The targct(s) for data is specified (for the case of data redirections or data 

duplication). Targets can be another mainframe disk location; a host memory 264; 
splitter memory 220; or an open system disk. 

2. The transformations and the reformatting to be performed on the data for each data 
transfer arc specified. These refer to "formatters and translators" routines within the 
formatter and translator subsystem 514 to be applied on the data. In addition, the 
order of application of these routines is specified. 

3. Patterns within the data stream to be detected by the filter subsystem 5 1 3 that should 
trigger a notification to the host are specified. 

4. The conditions under which task execution component 5 10 has to notify the host 260 
of its execution state are notified. These include notifiy and wait and notify and 
continue operations. 

The session definition also includes host breakpoint information that identifies a trigger 
condition or a specific sequence of events that activate the breakpoint. The breakpoint 
information also identifies what action should be triggered when the breakpoint is detected; e.g., 
notify the host 260 and continue, or notify the host 260 and wail for a reply. The session 
definition also includes context information, which links to a process running on the host 260 
that initiated the defined session. 

Thus, using the above types of message, extended functions may be implemented as a 
defined domain and ordered list of operations and targets for an I/O stream. The action list of 
operations defined is a session definition is treated as a pipe; that is, the input to the entire action 
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list is the entire daia range. Some of ihe session definition components may imply further 
processing. For example, spec i lying an open systems disk may imply processing to send the 
information to a particular fibre channel target, and would imply various forms of negotiation 
and processing to send the data. 

Alter the splitter has been configured with session definitions (potentially lor many 
domains of interest) it is ready to start intercepting I/O streams and taking defined action in 
response to interrupts from the Ports. When the task execution component 5! I is invoked by the 
splitter Port, e.g., via Port interrupts to the link interface, the task execution component 51 1 
consults session definition data structures to determine whether the received frame information 
corresponds to a registered session definition and if so it performs the actions defined in the 
session definition. The specified actions can refer to translation and reformatting operations and 
filters provided by splitter components described below. 

Session definitions may be created and used in response to operator commands or the 
like. These commands may be communicated via a console 270 (sec, e.g., Figure 2A). 

Channel Component 

The channel component 512 implements logic to send CCWs or chains out one of the 
splitter Pons. In short, this component mimics the operations that might otherwise result from a 
mainframe host or a control unit (i.e., issuing a CCW chain). The channel component 512 
includes logic for interacting with a host 260 which initiates this component via a corresponding 
API (described below) that sends "service requests." It also includes logic for implementing the 
state machine corresponding to a CCW or chain, and for complying with the ESCON protocol 
for a channel as defined in the standard, to send such CCWs. 

More specifically, the host 260 may send a message over the bus 268 using interface 525 
to request that the splitter 200 perform additional data transfers and operations. The message 
specifies a service storage domain, for example, identifying the storage area on which the service 
is to operate. The storage area is defined in physical terms; for a particular device, a set of 
extents for ESCON or a set of blocks for SCSI. The messages also specify the direction of the 
transfer, the sub ject of the transfer, and the location of the data within buffer 220 from where, or 
to where, the data should be transferred. The service request also includes data processing 
information. Like the case of a session definition, the data processing information defines the 
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processing lhai the spliucr should perform on the data prior to the data bcin^ routed to the target. 
This information is in the form of an ordered list of data filters, transformation and translations 
that the splitter implements. The service request also includes breakpoint and context 
information analogous to that of the session definition. In this case, the task execution 
component 510 is invoked by a "service request** from the host 260 which specifics the above. 
For example, the host may make a service request to the splitter to cause the splitter to read a set 
ol tracks from a device into memory. In processing the service request, the software may create 
the necessary CCW frames (both header and payload of various types) in a buffer segment 
associated with processor 230 and then transmit those frames over the specified port. 

Service requests may be created and used in response to operator commands or the like. 
These commands may be communicated via a console 270 (see, e.g., Figure 2A). 

Filters . . 

The filter subsystem component 513 provides capabilities for inspecting the content of 
data Ilowing through the intelligent splitter 200 and for testing whether prescribed patterns exist 
within the data. The routines provided by this component may be referred to in session 
definitions or service requests. The patterns to be tested are defined by the host and 
communicated to the software executing on the splitter 200 through the host/splitter interface 
525. When a pattern is detected the splitter software, a nptifying message is sent to the host 260 
and action may be taken. For example, a filter may remove a specified pattern from a data 
stream. Filters that may be used include those found in data mining and database technologies, 
though the architecture permits the specification of new filters. 

Analogously to filters, the formatter and translator component 514 provides formatter and 
translator routines that may be referred to in session definitions or service requests. For 
example, a formatter may include logic to format blocks of one size to another, and a translator 
may translate data from one form (e.g., EBCDIC) to another (e.g., ASCII). Akin to situation 
with filters, the formatters and translators used may include those found in data mining and 
database technologies, though the architecture permits the specification of new formatters and 
translators. This functionality might be used at higher levels of abstraction as well, for example, 
by utilizing metadata from a file system to reformat database records. 
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Diagnostics 

The diagnostic subsystem 5 1 5 is responsible for executing test programs. 
Host/Splitter interlace 

Host/splitter interface software 525 defines a mechanism for transferring data and control 
between the host 260 and a splitter 2()0. The interface software 525 facilitates the control of the 
splitter's work, shares information hci ween the splitter 200 and the host 260, synchronizes 
content of corresponding data structures across hosL/ splitter boundaries, and synchronizes 
execution How between the tasks executing on a host and the tasks executing on a splitter. 
Among other things, this interface helps the host 260 configure a splitter 200 and request 
services from the splitter. 

In one embodiment, interface 525 uses messages to provide communication between the 
host and a splitter. Messages arc sent over the bus 268 by a PCI bus driver, and the splitter and 
host communicate according to a defined splitter interface API . 

PCI Driver 

The PCI driver manages a physical communication layer corresponding to the bus 268, 
e.g., a PCI bus. (The term PCI driver is used herein to refer to a preferred implementation for bus 
268, though the invention is broadly' applicable to other arrangements including other bus 
architectures.) The driver implements general message passing between the host 260 and the 
splitter 200 over the bus 268. in one embodiment, the data transfer mechanism between the host 
260 and the splitter is asymmetrical. This provides one way to help avoid deadlock. All intra- 
platform message buffers (i.e., messages between the host and splitter) are maintained only in the 
splitter's processor memory 240 as two circular message buffers. One buffer is for incoming 
messages (to the splitter from the host), and the other is for outgoing messages (from the 
splitter). In the latter case, the host will cause a copy of the message to be made in the host's 
memory 264. Only the host directly accesses the splitter memory 240. The splitter 200 never 
accesses the host memory 264. That is, only the host will initiate DMA transfers between the 
host and splitter, though the host will instruct the splitter to do the DMA on the host's behalf. 

Depending on the data transfer length and the destination of the data transfer, the PCI 
driver implements one of the following transfer methods. For small payloads (i.e., less than 128 

- 23- 



BNSDOCID: <WO 01O1257A2_l_> 



WO 01/0*237 



PCT/US00/17760 



bytes) mailbox messages arc used. In ihis case, ii is assumed thai the daLa conslilulcs a message 
and not data lor populating a particular data structure. The data will be directed to the next 
available circular buffer. For data longer than 128 bytes, it as assumed that the data populates a 
particular data structure. In this case the DMA mechanism will be used, and the data will be 
**DMA"cd directly to the data structure in question.. 

The PCI driver interlace is defined with the following functions: 

1 . PCpciSendMessage (): This function loads the next free circular incoming buffer 
and triggers a mailbox interrupt on the splitter. This function returns control after 
generating a mailbox interrupt to the other side. 

2. ESpciSendMessage (): This function loads the next free circular outgoing buffer 
and triggers a mailbox interrupt on the host side: This function returns control after 
generating a mailbox interrupt to the other side. 

3. ESpciFreeMessage (): This routine marks a message buffer as free. This 
function returns control right away after initialing the DMA operation. 

4. ESpciDMARead (): This function DMAs data to the splitter from the host using the 
splitter's DMA mechanism 266. This function returns control right away after 
initialing the DMA operation. 

5. ESpciDMAWrite (): This function DMAs data from the splitter to the host using the 
splitter's DMA mechanism 266. 

6. ESpciMessageReceived (): a mailbox ISR invoked on the splitter 200. The 
routine will invoke task execution component to process a received message. 

7. PCpciMessageReceived (): a mailbox ISR invoked on the host 260. When 
this function returns control the software may assume that the message buffer has 
been freed, Le., that the host has already copied the message to its memory as 
outlined above. 

8. ESpciDM&complete (): a DMA ISR called when the DMA operation completes. 
This routine will cither wake the task that initialed the DMA operation (for 
synchronous operations), or invoke a callback routine defined by the invoking task 
(for asynchronous operation). 

The driver maintains the buffer pointers (next free buffer and next available message), 
DMA operation is executed explicitly or implicitly only on the host's request. It will be 
executed implicitly by sending a message using PCpciSendMessag requesting data transfer from 
a specific location in one address space (host/splitter card) to a specific location in another 
address space. It will be executed implicitly as pari of an operation requested by some other 
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message type (execute CCW chain lor example)., In any case, the splitter may assume that, 
whenever it docs DMA to/from memory locations in the host 260, it has exclusive access to 
these memory locations. 

Splitter Interlace API 
The splitter interlace API defines the interlace through which applications access the 

functionality provided by the splitter 200 and splitter subsystem 5 10. (By defining such an API, 

third party developers may create applications and functions lor the system.) From the 

application's perspective, the splitter provides the functionality defined by splitter interlace API, 

and all of the lower-level splitter details arc hidden from the application s view. 

The basic method of interaction using thus interface is as follows. The host stilt ware uses 
routines from the API to communicate with the splitter. The routine sends a corresponding 
message to a relevant splitter using the PCpciSendMcssage routine of the bus driver interface. 
This routine passes the host's request along with the necessary, corresponding parameters to the 
splitter. Each request has a corresponding lypc and message format understood by the software. 
When the splitter 200 completes the processing of the request, the splitter 200 sends a message 
back with the completion status and possibly with additional data to satisfy the request. Some 
API routines execute synchronously, meaning that the host routine making the call blocks from 
further execution until a reply message is received from the splitter. Some API. routines operate 
asynchronously (i.e., non-blocking). In one embodiment, asynchronous routines operate in 
conjunction with a callback routine that should be invoked when a response message eventually 
arrives from the splitter. 

The interface API may be categorized based on the functionality the routines provide. 
For example,one embodiment provides an API having routines for (1) defining sessions; (2) 
service requests; (3) breakpoint/notification requests; (4) querying and setting the current state of 
a splitter; (5) receiving and/or sending data update requests; and (6) resources allocation 
requests. 

Session Definition: These API routines are used to establish session definitions in the 
splitter 200. . As outlined above when describing the task execution component 51 1, a session 
definition defines a scope of data for which the splitter 200 has to execute some specified work. 

In one embodiment, the API uses the following lypc of messages to establish session 
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definitions on the splitter. A ScsskmStailMsg defines all the parameters of a session as specified 
above and activates the session. This message may be sent by the host 260. A 
ScssionManagcMsg execute management operations on an existing session task (e.g., query the 
session's state or cancel the session). A Sessions tatusMsg reports status ot an existing session, 
cither a reply to ScssionManagcMsg, or as a result of some unrecoverable error state in the 
splitter. 

Service Request: These API routines are used by the host to request a service. A service 
request is associated with a CCW or CCW chain ol granularity. Under some embodiments, a 
host 260 may issues service requests corresponding to some CCWs of a chain to one splitter, and 
other service requests corresponding to other CCWs of the chain to another splitter. When a 
splitter finishes processing the service request it sends a completion notification message back to 
the host. 

This API uses the following messages to implement service requests: 

1. ChainsStartMsg: defines all the parameters of a service as defined above and starts its 
execution. 

2. ChainsManagcMsg: executes management operations on an existing service task 
(e.g., query the service s state or cancel the service). 

3. ChainsStatusMsg: reports status of an existing service, either a reply to 
ChainManagcMsg, or as a result of* some unrecoverable error state in the adapter. 

4. ChainContinueMsg: sends a particular frame type in a pending chain. For example, to 
keep a domain locked, an appropriate breakpoint is set for the domain-end, causing 
the adapter to delay the status- ACK frame of the last CCW in a domain until a 
ChainContinueMsg is received. 

Breakpoint/notification request: These routines are used to establish breakpoints and to 
communicate associated notifications. This mechanism allows the splitter software and the host 
software to cooperate as a distributed application, to facilitate certain features like recovery 
procedures, and to support protocols having frame multiplexing. As outlined above, a breakpoint 
specifics conditions at which the splitter should inform the host of the breakpoint event. The 
splitter informs the host of such an event through a notification message. In one embodiment, 
there arc two methods of defining and setting breakpoints/notification request in the splitter code. 
One method includes the information as part of a session definition or a servicd request. The 
other explicitly uses a breakpoint/notification request. 



- 26- 



BNSDCCID: <WO_0101257A2_L> 



WO 01/01257 



PCT/US00/17760 



In one embodiment suitable lor ESCON environmcnts v hreakpoints are classified into ihc 
lb Mowing groups: 

1. Chain events. These include Start of chain events. End of chain events. Stan of 
domain events and End ol domain events. These events help the software 
cooperatively track the state of chains ol'CCWs and the like. 

2. Data related events. These include "data ready" events, indicating that a data read 
condition is satisfied, e.g., after a certain amount of data was received; "buffer free" 
events, indicating that a data write condition satisfied, e.g., after a certain amount of 
data was sent; "data transformation start*' events sent just before beginning data 
processing as defined by the corresponding session/service; "data transformation end" 
event sent just after data ending data processing as defined by the corresponding 
session/service. - 

3. ESCON protocol related events. These inciudc the following: 

• After ESCON CCW received 

• Before ESCON CCW transmuted 

• After ESCON Status received 

• Before ESCON Status transmitted 

• : ESCON disconnection 



• ESCON reconnection 

• After ESCON frame received 

• Before ESCON frame transmitted 

• ESCON sense data 

4. Data format update events. These indicate, for example, that the splitter read a track 
with previously unknown format 

5. Error condition interrupts. These include task completion interrupt, ESCON link 
failure interrupt, general error condition interrupt, timeout interrupt, and shutdown 
interrupt. 

The following, messages arc used in association with breakpoints and notifications: 

I; , BrcakpointSctupMsg: adding, modifying or canceling a single breakpoint type for an 
existing session or chain task. Usually used as a response to a breakpoint, to set up the 
next breakpoint in the expected sequence ol events. 
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2. DomainChcckMsg: breakpoint message from ihc adapter on a domain start. This 
requests information from the host regarding the relevance of the associated ESCON 
extent to any active sessions on this splitter. 

3. TracksFormalMsg: used by the splitter to transfer ESCON track format information 
to the host's cache (after a Read-Track or a Format- Write), or as a host's reply to a 
breakpoint (e.g., DomainChcckMsg) to update the track-format cache on the splitter. 

4. DomainStatusUpdatcMsg: either a reply to DomainChcckMsg, or an update to 
information that is cached on the splitter, regarding the relevance of specific extents 
to the active sessions on the splitter. 

5. BlocksMapMsg: used by the host to update the offsets of data in a buffer that is later 
transferred to a SCSI device (i.e., a partial information about ESCON-SCSl 
mapping). This is a reply to DomainCheckMsg 

6. DisconncctionMsg: breakpoint message for a disconnection. Used by the splitter to 
transfer the status of the disconnected chain to the host, so it may be resumed on 
rcconncction. Used by the host to transfer the same status to the splitter that receives a 
rcconncction (or a negative response in case the rcconnection is refused). 

7. RcconncctionMsg: breakpoint message for a reconncction. The splitter requests a 
negative or positive reconneclion reply from the host. 

8. DataReadyMsg: breakpoint message I rom the splitter after the data is ready on it (i.e., 
the data was read from an ESCON device or received from a host), or a breakpoint 
reply from the host to the splitter after the data from a SCSI device or from the host 
was transferred to a buffer on the splitter. 

9. BufferManageMsg: breakpoint reply from the host to assist in the buffer management 
on the splitter. Informs the splitter of what data was read by the host (or the SCSI 
card). Virtual buffer identifiers are used to mask the splitter's implementation details. 

Query and set the current state of the splitter The API for querying and setting the 
current state of the splitter sets various state variables on the splitter like putting splitter in debug 
mode, for example, or selling Port slate in splitter software to indicate that the Port is offline. 

Receive/send data update request: The API for receiving and sending data update 
requests. This is used to get an update, or send an update of some shared data, like update format 
track table, for example, or to inform a splitter of a control unit to which it is connected. 

Allocation of resources request: The API for allocation of resources sends messages to 
allocate resource and facilitate resource management, e.g., avoid deadlocks. This may be used by 
the host to allocate and lock some splitter resources like a Port or buffer space on the splitter, 
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needed lor the execution of one of the host's tasks requested via a service request. This request 
is not exposed to. the third party APIs, and is instead used internally hy other request types. 

Host Subsystem 

As shown in Figure 5, the host subsystem 520 includes a web interlace component 53 1 , 
an interrupt handler 532, file system mappers 533, system management 534, a Fibre Channel 
(FC) subsystem 535, system services 536, and clustering services 537. These components and 
services in turn may be used by host applications 540 and the other host components to 
implement data management applications and functions (e.g., mirroring, backup, caching, etc.) 

The web interface component 53 1 provides an interface to a web server subsystem 270 thai 
allows a web server to facilitate servicing, managing, and debugging of the sysicm via a remote web- 
bascd console. 

The interrupt handler 532 handles breakpoints and assists with their dispatching. 

The file system mappers 533 include logic to map logical objects into their corresponding 
physical addresses. These physical addresses may then be used in the construction of session 
definitions and service requests. In one embodiment, the file system mappers cooperate with 
agents on other systems. For example an agent may execute on a file server 280 operating under 
another operating system (e.g., LINUX, NT, Solaris) and thus other file server may be 
effectively merged into the system using the host 260. For example, the file mapper may 
cooperate with such agents to request physical and logical level information from such file 
servers (Le., metadata) or to perform other operating system functions on the host behalf (e.g., 
open a file on the server). The host may use metadata to implement logical level functions and 
operations in conjunction with a splitter, e.g., mirroring on a logical level. 

System management 534 is responsible for performing administrative functions, e.g., 
configuring the system, shutting down devices, and the like. 

Fibre Channel (FC) subsystem 535 is responsible for communicating with components 
290 over a fibre channel protocol. 

System services 536 is responsible for providing and interlacing with operating system 
services. 

Clustering services 537 is responsible for arranging hosts 260 and splitters into clustered 
arrangements to improve connectivity and reliability. (This clustering Is not to be confused with 
mainframe-level clustering described below.) The types of connections supported arc 
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conventional. The services, among other things, provides software to synchronize >and help keep 
coherent data structures among hosts 260. This includes managing a virtual memory space 
shared by potentially multiple hosts 260. 

Applications 540 are data management applications and extended functions that may be 
used in the system. These include applications that may be developed by third parties. The 
applications use the functionality as defined in the host subsystem 520 and the splitter interface 
described above. That is, applications may be developed to use the defined API of the host, and 
the splitter, as described above. 

At a high-level of abstraction, all applications 540 perlorm the following. Each 
application pcrlorms some application-specific tasks that may require the use "of file mappers 
533, system services 536 and the like. The applications cause the host 260 to sicnd messages to 
the splitter 200 to define sessions and/or to request services. If the application needs to interact 
with software running on the splitter 200, the application software cither directly or indirectly 
needs to establish breakpoint conditions in the splitter software logic, at which the host 
application desires to be notified. The host application may call a system service that will cause 
the application to block until an incoming breakpoint message is received for that application, 
and that will cause the application to unblock (or wake-up) upon reception of the message. The 
application may then access a corresponding buffer that includes status information and the like 
about the breakpoint. The application may then react (or "handle") the breakpoint in an 
application-specific way, for example, by "switching" on the breakpoint type information and 
calling an associated callback routine for the type. Once done with such handling, the 
application may then call the system service and block until the next breakpoint. 

A breakpoint handler/ dispatcher helps control the invokation of applications and the like 
in response to messages passed back and forth between the host 260 and the splitter 200. This 
logic operates analogously to event engine techniques using registered callback routines. In this 
case, however, multiple applicaiions may effectively register for a notification. 

Exemplary Applicatio ns and Extended Functions 

Figure 3A illustrates a simple system 300 to illustrate certain basic applications and 

functions for which the system may be programmed using the principles of the invention. The 

system 300 includes intelligent splitter 200, a mainframe computer 310. two DASD control units 

320 ? 330. and several communications links. More specifically, an ESCON port 3 12 of 
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computer 310 is connected to Port A of intelligent splitter 200 via an ESCON communications 
link: an ESCON port 322 of control una 320 is connected to Port B of intelligent splitter 2;00 via 
another ESCON communications link; and an ESCON port 332 of control unit 330 is connected 
to Port C of intelligent splitter 200 via another ESCON communications link. 

One simple use -of intelligent splitter 200 in network 300 is to provide "mirroring," in 
which specified physical data on one control unit (e.g., a volume maintained by a primary control 
unit 320) is maintained as a concurrent copy on control unit 330. For this application, intelligent 
splitter 200 is appropriately configured to delect relevant I/O stream information. This 
configuration may he performed with the assistance of a host as described above, but it need not 
be and thus host 260 is excluded from this figure to illustrate this possibility. (For example, all 
of the necessary code may exist in a local memory of splitter 200.) Once so configured, the 
splitter 200 monitors I/O streams from mainframe host and/or from primary control unit 320. 
The splitter software may then monitor command frames from computer 310 and/or command 
response frames from control unit 320 to effectively track and detect when writes are to occur to 
a volume of interest. When a write to the targeted area is detected, the splitter may send that 
command frame to the control unit 320, unmodified, via Port B. The splitter 200 will create a 
modlled version of that command frame to reflect a different physical address on control unit 
330 that is dedicated to holding the mirrored data of interest and send the modified command on 
Port C. The splitter 200 may then monitor write command acknowledgment frames from the 
control units 320, 330 to ensure that each is capable of accepting the command, and to create or 
retransmit a write command acknowledgment frame to the mainframe 3 10. For example, in one . 
embodiment of this application, the splitter may simply analyze acknowledgment frames from 
both control units and relay ihe worst case -status to the mainframe. Assuming that the control 
units can accept the write data, the splitter will send data requests to the mainframe 310. 
accordingly and subsequently receive data frames in response. These data frames are then sent 
by the splitter to the control units unmodified. All frames coming from the control units, e.g., 
data request and status, are monitored by the splitter, which then creates corresponding messages 
therefrom to send to the mainframe 3 10 or which simply retransmits the worst case command 
received from the control units. For example, if the status information received from the control 
units indicate that both control units completed the write command successfully, then intelligent 
splitter 200 reports successful status information back to mainframe computer 310. However, if 
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the siams information received from the control units indicates that one or both of the control 
units did not complete the command successfully, then intelligent splitter 200 may report an 
unsuccessful status back to the mainframe computer 310. At this point, the mainframe computer - 
310 would normally invoke its standard error recovery procedures to handle the unsuccessful 
writes. As an alternative, if one or both of the control units reported an unsuccessful slams to 
intelligent splitter 200, intelligent splilier 200 could respond by requesting one or both of the 
control units to rc-cxecutc the write command. 

The above example illustrates two powerful operations performed by intelligent splitter 
200: (1) data replication and (2) header modification (or header manipulation). As stated above, 
every ESCON frame includes a header component and a data component. (This organization 
exists in other I/O protocols as well.) The header component identifies the sender of the frame as 
well as the intended recipient, or destination, of the frame (e.g., control unit and device number 
on the control unit). The data component includes the data that is being transmitted from the 
sender to the recipient of the frame. As an example, during the data transfer phase of a write 
command, the data component of a frame would include data that is being written from the 
sender (e.g., a mainframe computer) to the destination (e.g., a control unit). The combination of : 
header modification and data replication allows intelligent splitter 200 to send received frames to 
more destination devices (or other destination devices) than were originally addressed by the 
sender. In addition, from the perspective of mainframe computer 310, intelligent splatter 200 
may appear to be a control unit (i.e., computer 3 10 may communicate with intelligent splitter 200 
in exactly the same fashion in which computer 310. would communicate with a control unit). 

In some frames, the data component (also known as a payload) helps further define an 
eventual target of interest. For example, in a locate record command, the payload will specify a 
record address of eventual interest. By having visibility into ihis component and an ability to 
alter it, the splitter 200 may perform other functions such as extent relocation. This may be used 
in many ways. One possible way includes relocating critical sections from two devices to be ; 
contiguous on a third device. 

From the perspective of control units 320, 330, intelligent splitter 200 may appear to be a 
host computer (i.e., control units 320, 330 may communicate with intelligent splitter 200 in 
exactly the same fashion in which the control units 320, 330 would communicate with a host 
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computer). This feature of being able to communicate like a host computer or a control unit 
generally allows intelligent splitter 200 to be transparent to all devices to which it is connected. 

For purposes of the above example, only write commands need to be executed by both 
control units to maintain the integrity oi the mirrored data on control unit 330. AH other 
commands (e.g., read commands to the data of interest) received from mainframe computer 310 
may be executed only by primary control unit 320. In some situations, it may be desirable to 
send other commands to the control units (e.g., reads in the above case) so that the control state 
of the control units more accurately mirrors each other. 

Alternatively, the splitter may be programmed to switch over read commands to the Port 
having the mirroring control unit, if the primary control unit is no n- responsive. 

As a slight alternative to the above, control unit 330 may be a different type of control 
unit than primary unit 320 and may store data in a different formal. In such case, instead of 
having the splitter 200 send unmodified versions of the data frames received from mainframe 
3 1 0, the splitter 200 can reformat the data to the new format. , In the embodiments described 
above, the frames would be received from the mainframe into buffer 220, where they would be 
read and retransmitted to primary control unit 320. The processor 230 would read the same data 
and reformat it .as required and write the reformatted frame information into a buffer segment 
associated with the processor 230. Subsequently, the reformatted data may be transmitted to the 
mirroring control, unit 330. 

As another example of reformatting, the primary control unit 320 could store data to 
dusks and the other control unit 330 may have tape storage. The splitter 200 could be caused to 
read data from the control unit 320 via Port B ajid to write a reformatted, version thereof to Port 
C. In the embodiments described above, this could be implemented by the host 260 issuing 
service requests to the splitter to make the appropriate reads and writes. In addition, the host 
may help map logical to physical addresses and, in this regard, can cause the function (in this 
case "offline back-up") to be implemented at a logical level, e.g., a file. However, as suggested 
by Figure 3 A, the principles are generally applicable and do not necessarily need a host 260. 

This alternative illustrates an important point. Under the principles of. the invention, 
heterogeneous control units and devices may be included in a system and used in coordination to 
implement extended functions. In this example, two different types of control units were used 
for a form of extended function known as offline backup, but the principles are generally 
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applicable. to other functions. Once the system is configured (whether by use of session 
definitions or otherwise), the system performs the extended function without requiring any 
special commands to be sent over the link. This leads to another important point. Nameiy, 
under the principles ol the invention, extended functions may be implemented even though the 
control units only received basic, standard commands that are part of the ESCON vocabulary: 
No special commands needed to be sent to the control units. Thus, the control units do not need 
the extra complexity and sophistication to implement the extended functions, and instead control 
units that only provide the basic vocabulary are satisfactory. Thus, less costly control units may 
be used. Moreover, the control units did not need to dedicate any ESCON ports (often an 
expensive resource) to implement any proprietary inter-control unit links to implement the 
function. Thus, the extended functions do not detrimentally impact storage system connectivity. 

Alternatively, an application may be 'desired where ihc mirrored version of daia should be 
translated (e.g., EBCDIC to ASCII) or where certain data patterns should be filtered out (e.g., 
pattern "FFFP ). In such cases, the splitter 200 may be caused to perform the appropriate 
translation or filtering of the data frames. 

As a variation to the offline copying above (which off course need not involve disparate 
forms of control units or physical storage), the system may t>e operated so that a mainframe can 
access a control unit while intelligent splitter 200 ^performing off line copying or back-up. For 
example, intelligent splitter 200 may receive a read command from computer 310 while 
intelligent splitter 200 is in the process oi copying a segment of data from primary control unit 
320 to backup control unit 330. Intelligent splitter 200 may then stall the process oi copying data 
from primary control unit 320 to backup control unit 330 and service the command received 
from computer 310. After the command from computer 310 has been serviced, intelligent 
splitter 200 may then resume the operation of copying data from primary control unit 320 to 
backup control unit 330. In the above embodiments, the task execution component could 
prioritize the operations of handling online and offline transactions: The task execution 
component also has the capability to determine that one task has the necessary resources to 
perform its operation, and it can thus ensure that the bnLir.e task is scheduled at an appropriate 
instant. For example, if intelligent splitter 200 is in the process of copying a very large segment 
of data from primary control unit 320 to backup control unit 330 (Lc.,foy issuing a long sequence 
of read and write commands), intelligent splitter 200 could service the command received from 
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computer 310 immediately alter completion oi the current' Vcad-wriic cycle (i.e.. alter the write 
command is executed by backup control unit 330). Alternatively, intelligent splitter 200 could 
interrupt a read- write cycle to immediately service the command received from computer 310. 

Another function thai intelligent splitter 200 may perform while engaged in off iinc 
copying is intelligent execution of write commands received from computer 310. As an 
example, suppose that intelligent splitter 200 is engaged in copying all data from primary control 
unit 320 to backup control unit 330 (a volume level backup) when intelligent splitter 200 
receives a write command from computer 3 10. intelitgcnt splitter 200 could cause both control 
units to execute the write command. Alternatively, intelligent splitter 200 could analyze the 
address range affected by the write command and ( i ) cause both control units to execute the 
write command if the address range affected by the write command has already been copied by 
intelligent splitter 200 from primary control unit 320 to backup control unit 330 or (2) cause only 
primary control unit 320 to execute the write command if the address range affected by the write 
command has not yet been copied by intelligent splitter 200 from primary control unit 320 to 
backup control unit 330. 

As another example, intelligent splitter 200 may be programmed to move data from a 
first domain on a storage to another domain on the storage (e.g., to perform extent relocation ). 
In addition, the splitter may be programmed to redirect subsequent writes to the first domain to 
the new domain. 

Another useful application that may be performed by intelligent splitter 200 is "data How 
monitoring" and statistics gathering. Intelligent splitter 200 may be programmed to gather 
statistics during processing. Alternatively, for arrangements having a host 260, the splitter and 
host may be programmed to cooperate so that every time one of the intelligent splitter's Ports 
writes command frames into the buffer 220 the sputter notifies the host 260 accordingly. This 
notification is received by the host which dispatches to a corresponding statistics routine which 
causes the information from buffer 220 to be sent via bus 268 to host memory 264, where it may 
be analyzed to generate various statistics about usage. 

Another function that may be performed by the system is caching either at a physical or 
logical level, or in an application-specific manner. For example, the host may create session 
definitions in which write data is stored in host memory 264 only for very specific portions of 
data defined in physical or logical terms, e.g., a frequently accessed index file of a file system. 
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Subsequent commands may be intercepted, and tested for hits to this cache. Likewise, caching 
algorithms may be implemented that utilize gathered statistical information. . 

Other applications that may be implemented on the system include data replication, 
volume remapping, extent relocation, prefetcch algorithms, caching, data reformatting and/or 
translation, statistic packages, and security application (e.g., firewall-like, applications to 
storage), in addition, the system may be programmed to perform fail-over or switchover 
functions. For example, the system may be programmed to detect or determine when a control 
unit has gone offline and in response thereto automatically route commands to a backup control 
unit. Alternatively, in some cases, it may be preferable to automatically send commands to both 
control units (Lc., without a switchover condition previously being detected) and to switchover 
dynamically if the primary unit fails to respond. 

Figure 3B illustrates another illustrative system 350, In system 350, both of the 
computers 360, 364 can read data from, or write data to, control unit 370. In general, when Port 
A of intelligent splitter 200 receives a command from computer 360, the intelligent splitter 200 
will send a copy of that command via Port B to control unit 370: and the intelligent splitter 200 
will send a copy of any data received from control unit 370 via Port B in response to that 
command via Port A to computer 360. Similarly, in general,, when Port C of intelligent splitter 
200 receives a command from computer 364, the intelligent splitter 200 will send a copy of that 
command via Port B to control unit 370; and the intelligent splitter 200 will send a copy of any 
data received from control unit 370 via Port B in response to that command via Port C to 
computer 364. This arrangement allows two computers to share a control unit. 

To prevent the two computers from attempting to simultaneously use the same resource 
(i.e., the control unit 370) and potentially corrupt state or cause a deadlock, the splitter 200 can 
be programmed to control access to the critical resource (i.e., unit 370) using various forms of 
arbitration and scheduling techniques (e.g., using semaphores or analogous mechanisms). For 
example, if computer 360 transmits a command intended for control unit 370 to intelligent 
splitter 200 while control unit 370 is busy processing a command previously received from 
computer 364 to the same critical resource, intelligent splitter 200 may respond to the command 
received from computer 360 in at least three fashions. First, intelligent splitter 200 may simply 
report an unsuccessful status back to computer 360. Second, intelligent splitter 200 may wail 
until control unit 370 completes processing of the command previously sent by computer 364 



- 36 - 



O101257A2_l_> 



WO 01/01257 



PCTAJSOO/17760 



be lore sending the command received from computer 360 to conirol unii 370. Third, intelligent 
splitter 200 may interrupt the control unit 370, report an unsuccessful status to computer 364, 
and may immediately send the command received from compulcr 360 to control unit 370 for 
immediate processing. It will be appreciated that intelligent splitter 200 may arbitrate between 
the two computers 360, 364 in numerous other ways. 

Figure 6 shows an exemplary system 600 having muUiplc splitters operating 
cooperatively. In the illustrated arrangement, three intelligent splitters 610, 612, 614 arc arranged 
with a mainframe computer 620 and four DASD control units 630, 640, 650, 660. In system 
600, a port 622 of computer 620 is connected to Port A of intelligent splitter 610 via a 
communications link; Port B of intelligent splitter 610 is connected to Port C of intelligent 
splitter 612 via a communications link; Port C of intelligent splitter 610 is connected to Port A of 
intelligent splitter 614 via a communications link; Port A of intelligent splitter 612 is connected 
to a port 632 of control unit 630 via a communications link; Port B of intelligent splitter 612 is 
connected to a port 642 of control unit 640 via a communications link; Port B of intelligent 
splitter 614 is connected to a port 652 of control unit 650 via a communications link; and Port C 
of intelligent splitter 614 is connected to a port 662 of control unit 660 via a communications 
link. One function that may be performed on such an arrangement is n-way mirroring. Control 
unit 630 may be treated as a primary control unit and control units 640, 650, and 660 may all be 
treated as mirror copies of all or a portion of the data written to the primary control unit 630. 

Figure 7 shows a block diagram of an exemplary embodiment illustrating that multiple 
splitters may share a host 260. More specifically, system 700 includes eight intelligent splitters 
(200- i through 200-8) which each communicate to host 260 via bus 268, associated bridging 
components (represented by DMA engines 266), and associated software. In one embodiment, 
each intelligent splitter is implemented on a single card and all cards arc mounted in a personal 
computer chassis. System 700 may be referred to as a "data gate,*' Any network including one or 
more intelligent splitters may implement the intelligent splitters as stand alone devices or by 
using a device like the data gate which includes a plurality of intelligent splitters. For example, 
one or more of the intelligent splitters in the data gate may be used to implement the systems 
discussed above (e.g., systems 3(XX 500. and 600 as discussed in connection with Figures 3, 5, 
and 6). 
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To illustrate the llcxible connectivity possible. Figure 8 shows a system 800 in which 
multiple data gates are used. 

Referring back to Figure 2A. that system shows how an intelligent splitter 2(H) may be 
integrated with other fibre channel systems 290 and with open systems components 270. For 
example, the splitter 200 may send data to the host 260, which has been read from a control unit 
320. The host 260 may then cause that data to be transferred over a fibre channel FC to a file 
system 270. Likewise, the host 260 may cooperate with an open systems filer server 280 through 
agent technology at the server 280 and integrate the open systems server 280 into the overall 
system. For example, the host 260 may send commands via a TCP network link to server 280, 
asking the server 280 to open and allocate a file on server 280. Subsequently, the host may 
receive data from the splitter 200 which is then written to such a file. In addition, the host 260 
and server 280 may communicate metadata about the file system served by server 280, and this 
data may be used by the host to perform logical to physical mappings io implement logical level 
functions and applications. 

Clustering 

Another useful function that may be performed with intelligent splitters is clustering. 
With clustering, mainframe computers may cooperate to perform an application specific task in a 
more reliable way. For example, one mainframe may be responsible for performing a portion of 
a distributed task, and a second mainframe may be responsible for performing another portion of 
the task. If one of the mainframes fails, the other docs the work of both, preferably as seemlessly 
as possible. To do this, the clusieringapplications need to share various forms of data, including 
application-specific data and data to control the access of such data and to indicate the 
operational status of various components. (more below). 

Figure 9 shows one clustering arrangement. 900 under a preferred embodiment of the 
invention. Two intelligent splitters 200A, 200B arc arranged with two mainframe host 
computers 9 1 OA, 9 10B ( "mainframes!') and DASD control units 920A, 920B, In system 900, an 
ESCON port (not shown) of mainframe 9 10A is coupled to Port A of intelligent splitter 200A 
via an ESCON communications link; an ESCON port (not shown) of mainframe computer 910B 
is coupled to Port A of intelligent splitter 200B via an ESCON communications link; Port C of 
intelligent splitter 200A is coupled to an ESCON port 924A of control unit 920A via an ESCON 
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communicalions link; Port C of imclligcni spliltcr 2(K>E is coupled to an ESCON port 924B of 
control unii 920B via an ESCON communicalions link: and Pon B of intelligent splitter 2(K)A is 
coupled to Port B of intelligent splitter 200B via an ESCON communications link. 

For the clustering application, the intelligent splitters 200 A, 200 B are programmed to 
transmit ail transactions from their respective mainframes 910A, 910B to their respective control 
units 920A, 920B. A subset of the transactions is caused to be transmitted to the other intelligent 
splitter. The transmitted data corresponds to the data to be shared with the other processor. This 
includes the application-specific data needed to implement collaboration by the processors; for 
example the data thai a mainframe will need to access if it becomes the only processor to 
perform ihc clustered task, if the other mainframe goes offline or becomes inoperative. It also 
includes control data- to ensure that ihc application specific data is synchronized as necessary. It 
also includes status data, indicating for example whether a given control unit has gone offline. 
To transmit such data a private link connecting the two Port Bs is used in this embodiment. 

When the data is received by the other intelligent splitter, it may then be cached in a host 
memory 264 (not shown in Figure 9), where it may be accessed by the other mainframe. The 
arrangement allows data processing operations to be performed on the data to be transmitted to 
the other splitter, akin to the processing that may be specified in session definitions and service 
requests, and thus allows clustering in heterogeneous systems. In addition, the clustering may be 
implemented at a logical ievei, and allows the shared data to be transmitted more quickly, i.e., 
before it is written to storage in the primary unit. 

Figure 10 illustrates an alternate embodiment of clustering using a two port intelligent 
splitter integrated into control units 1020 A, J020B: The two port splitter operates analogously 
the three port device described above, except it has two pons. Private link 1060 is used to carry 
the information between splitters, in the clustering context, the two port device may 
communicate with the control unit via a bus 268 like that described above, e.g., PCI. To-be- 
shared information is then cached in the main memory 930 of the control units. Alternatively, in 
Figure 10, the control 932 may serve the function of host 262 described above, and memory 930 
may serve the function of host memory 264. 

In the exemplary embodiments above, the private link may be used to carry the 
information to be shared as specified in a session definition, for example. This definition would 
specify the I/O streams of interest passing between respective mainframes and control units. It 
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would also include any necessary synchronization information to ensure thai when each 
mainframe or control unit accesses data thai that data is coherent. Ii would also include status 
inlormation about the respective components. For example, the splitters may he programmed to . 
delect when a respective control unit has gone offline... This may be the result of an error 
condition or because of an administrative action. In such case, the I/O requests will need to be 
serviced by the control unit still online. For example, it control unit 920 A goes offline, 
mainframes 910 A and 910 B would need their I/O requests handlpd by, control unit 920 B. 
Splitter 200 A would need to then send any I/O from mainframe 910 A to ihe splitter in front of 
control unit 920 B to be subsequently handled by 920 B. (This situation is analogous to the 
arrangement of Figure 3B; and that discussion is relevant here as well.). 

More specifically, with reference to the embodiment of Figure 9, the intelligent splitters 
200 may be appropriately configured to detect relevant I/O stream information (information 
relevant to the clustering). This configuration may be performed with the assistance of a host 
260 (not shown in these figures) as described above. pnce { so configured, the splitter 200 
monitors I/O streams from mainframe hosts (either. directly from a port connected to a 
mainframe or indirectly from a private link connected to another splitter). The splitter also, 
monitors I/O streams from respective control units, e.g., 920 A (either directly from a port 
connected to a control unit or indirectly from- the private link connected to another splitter). 

The splitter may then monitor command frames from the mainframes and/or command 
response frames from the control units to effectively track and detect when writes or reads are to 
occur to ihe data of interest and to act accordingly. For example, when a write to a targeted area 
is detected, the splitter 200 A may send that command frame to the attached control unit, e.g., 
920 A, unmodified, via Port C. The splitter 200 will then convey that information on a private 
link to the other splitter, e.g., 200 B. The sending splitter 200 A may create a modified version 
of that command frame to reflect a different physical address on control unit 920B, or this 
modification may be delegated to the splitter 200B. The splitter 200 A may then monitor write 
command acknowledgment frames from the control units to ensure that each is capable of 
accepting the command, and to create or retransmit a write command acknowledgment frame to 
the mainframe that sent the write. For example, the splitter may simply analyze 
acknowledgment frames from both control units and relay the worst case status to the 
mainframe. Assuming that the control units can accept the write data, the splitter will send data 
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requests to ihc mainframe accordingly and subsequently, receive data frames in response. These 
data frames are then sent by the splitter to the control units unmodified (again either directly or 
indirectly) and can be cached at the respective splitters as well. All frames coming from the 
control units, e.g., data request and status, arc monitored by the splitter, which then creates 
corresponding messages therefrom to send to the mainframe or which simply retransmits the 
worst case command received from the control units. For example, if the status information 
received from the control units indicate that both control units completed the write command 
successfully, then intelligent splitter 200 reports successful status information back to mainframe 
computer. However, if the status information received from the control units indicates that one 
or both of the control units did not complete the command successfully, then intelligent splitter 
2(X) may report an unsuccessful status back to ihc mainframe computer. At this point, the 
mainframe computer would normally invoke its standard recovery procedures to handle the 
unsuccessful writes. As an alternative, if one or both of the control units reported an unsuccessful 
status to intelligent splitter 200, intelligent splitter 200 could respond by requesting one or both 
of the control units to re-execute the write command. 

Reads may operate similarly. However, unlike writes, reads need not necessarily be 
"cluster mirro red/' 

In addition, the logic to handle switch-overs may be implemented on the main frame-side 
or handled by the intelligent splitter (and/or corresponding host 260). For example, by 
conveying information that a control unit has failed back to the mainframe, the mainframe may 
then take action by sending I/O streams with address information identifying that the requests 
should be handled by the other control unit. Alternatively, this form of re-mapping may be 
handled by the software on the splitter-side. 

It will be appreciated that the front end 1050 of the control units 1020 is very similar to 
the intelligent splitter 200. However, whereas the intelligent spikier includes three 
communication pons, front end 1050 includes only two communication ports. Also, whereas the 
internal system associated with intelligent splitter 200 is normally a PC environment (e.g., as 
shown in Figures 2D and 7), front end 1050 is built into and forms an integral part of a control 
unit. Figure 10 does not show a processor and processor memory as being a pan of front end 
1050. However, it will be appreciated that front end 1050 docs preferably include a processor 
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memory and a processor ( 1 ) that can access butter 1056 via the system bus 936 and (2) thai can 
access a processor memory. 

Qihcr Variations 

Various preferred embodiments were described above. Numcrous variaiions of the above 
arc foreseeable. For example, though many of the embodiments arc with reference to a splitter 
cooperating with a separate host, ihc system may operate with one processor only either in the 
splitter or as a host. In addition, the general principles described above may be implemented as 
special purpose systems, lor example, thai only implement mirroring, instead of generalized 
platforms as described above. In the case of special purpose systems, the control may be 
implcmentablc with control logic as opposed to programmatic control. In addition, many ot the 
novelties realized in software (e.g., extended functions at a logical level) arc not necessary for 
the preferred form of splitter and vice versa. r For example, one of the many advantages of the 
intelligent splitter is that it has a common buffer and thus reduces the need for copying data. 
This novel architecture can stand on its own. Similarly, novel. systems may be realized with 
splitters that do not use a common buffer. For example,, a splitter that requires copying of data 
may be inferior to the preferred splitter described above, yet still be novel over known prior an. 
Likewise, though repeated reference was made to ESCO^ embodiments, the principles are 
applicable to I/O streams in general and may be adapted to SCSI or fibre channel implementation 
among others. In addition, special situations maybenettt from arrangements in which a splitter 
includes ports for communicating according to multiple protocols, e.g., one port being FC and 
another ESCON. 

Since certain changes may be made in the above apparatus without departing from the 
scope of the invention herein involved, it is intended that all matter contained in the above 
description or shown in the accompanying drawing shall be interpreted in an ..illustrative and not 
a limiting sense. 

What is claimed is: 
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1. An intelligent splitter device, comprising: 
a buffer memory; 

at least three input/output (I/O) ports, each port having logic lor transmitting and 
receiving information on an associated input/output (I/O) communication link, and each port in 
electrical communication, with the butler memory; and 

control logic lo control each port to write I/O data received from its respective I/O 
communication link into a portion of the buffer memory associated with that port, and to transmit 
data onto its respective I/O communication link by reading data from the buffer memory at 
selectable buffer memory locations, including locations associated with one of the other ports. 

2. The intelligent splitter of claim I wherein the protocol is ESCON. 

3. An intelligent splitter device lor communicating according to a frame-based I/O protocol 
in which each frame includes a header component and a payload component, comprising: 

a butler memory; 

at least three input/output (I/O) ports, each port having logic for transmitting and 
receiving information on an associated input/output (I/O) communication link, and each port in 
electrical communication with the buffer memory; and 

control logic to control each port to write I/O data received from its respective I/O 
communication link into a portion of the bulfer memory, the control logic further including logic 
to analyze a received frame and in response thereto create and transmit a modified version of the 
received frame on one of the I/O ports, wherein the modified version includes a modification to 
at least one of the header component and the payload component of the received frame. 

4. The intelligent splitter device of claim 3 wherein the control logic includes further logic 
for transmitting the received frame on another of the I/O ports in parallel with the transmission of 
the modified version of the received frame. 
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5. The intelligent splitter ol claim 4 wherein the control logic includes logic to modify the 
header component to contain a destination address that is different than a destination address in 
the received frame. 

6. The intelligent splitter ol claim 4 wherein the control logic includes a local processor and 
processor memory, wherein the processor memory includes processor-executable instructions 
responsive to the ports and capable of changing state in the pons to control the ports. 

7. The intelligent splitter of claim 6 wherein the processor memory includes data associating 
physical address information with corresponding defined splitter actions, and wherein the 
processor-executable instructions include instructions that 

read a portion of a buffer that contains frame information including physical address 
information; 

compare the physical address information read from the buffer with the data in the 

processor memory; and 

implement the corresponding splitter actions based on results from the comparison of the 
physical address information read from the buffer with the data in the processor memory. 

8. The intelligent splitter of claim 7 wherein the splitter actions define a set of processor- 
executable instructions to create and transmit a modified version of a received frame on one of 
the I/O ports, wherein the modified version includes a modification to at least one of the header 
component and the payload component of the received frame. 

9. An intelligent splitter device forcommunicating according to a multiphase I/O protocol 
having a command phase, a data phase, and a status phase, and wherein the data phase comprises 
at least one data frame, the spiiticr comprising: 

a buffer memory; 

at least three input/output (I/O) ports, each port having logic for transmitting and 
receiving information on an associated input/output (I/O) communication link, and each port in 
electrical communication with the buffer memory; and 
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control logic to cause at least two of the pons to transmit in parallel data phase 
information. . . . ...... 

10. An intelligent splitter device for communicating according to a multiphase I/O protocol 
having a command phase, a data phase, and a status phase, and wherein the data phase comprises 
at least one data frame, the splitter comprising: 

a buffer memory; 

at least three input/output (I/O) pons, each port having logic for transmitting and 
receiving information on an associated inputyoui.put (I/O) communication link, and each port in 
electrical communication with the buffer memory; and 

control logic to cause at least one port to transmit control phase information stored in the 
buffer, and in parallel therewith to cause at least one other pon to transmit a modified version ol 
the control phase information transmitted on the at least one port. 

11. An intelligent splitter device for communicating according to a multiphase I/O protocol 
having a command phase, a data phase, and a status phase, and wherein the data phase comprises 
at least one data frame, the splitter comprising: 

a buffer memory; ...... 

at least three input/output (I/O) ports, each pon having logic for transmitting and 
receiving information on an associated input/output (I/O) communication link, and each port in 
electrical communication with the buffer memory; and 

control logic to receive status phase information from at least two of the input/output 
ports and to send status phase information on a third of the three pons in response thereto. 

12. A method of communicating I/O information organized according to a multiphase 
protocol, having a command phase, a data pha.se, and a status phase, within an I/O system having 
a first device, a second device, and a third device in electrical communication with a 
communication device having at least three pons, the method comprising the acts of: 

receiving I/O information on a first pon of the communication device having at least 
three pons; ; , 



-45- 



BNSDOCID: <WO 0101257A2_I_> 



WO 01/91.257 



PCT/US00/17760 



the communication device having at least three pons transmitting the I/O information 
received on the first port to a second port of the communication device having at least three 
ports; and 

the communication device having at least three ports transmitting a modified version of 
the I/O information received on the first port to a third port of the communication device having 
at least three ports. 

13. A method of communicating I/O information organized according lo a multiphase 
protocol, having a command phase, a data phase, and a status phase, within ah I/O system having 
a first device, a second device, and a third device in electrical communication with a 
communication device having at least three ports; the method comprising the acts of: 

receiving I/O information on a first port of the communication device having at least 
three pons; and ; * 

the communication device having at least three ports transmitting a modified version of 
the information received on the i'irsi port to one of thfe second and third ports of the 
communication device having at least three ports. . . 

14. A method of communicating I/O information organized according to a multiphase 
protocol, having a command phase, a data phase, and a status phase, within an I/O system having 
a first device, a second device, and a third device in electrical communication with a 
communication device having at least three ports; the method comprising the acts of: ' 

caching at least selected I/O information in the communication device having at least 
three ports; " 

receiving I/O information on a first port of the communication device having at least 
three ports; : 

analyzing the received I/O information to determine physical address data of the I/O 
information; 

transmitting cached I/O information on the first port if the received information hits the 

cache; 

transmitting the received information on one of the second and third ports if the received 
information misses the cache. 

-46- 



BNSDOCIO: <WO 0101257A2_I_> 



WO 01/01257 



PCT/US00/17760 



15. A method ol communicating I/O information between three devices in which the I/O 
information is organized according to a multiphase protocol, having a command phase, a data 
phase, and a status phase, and in which the inlormaiion is organized as frames each having a 
header component and a payload component, the method comprising the acts ol: 

receiving I/O inlormaiion on a first port ol a communication device having at least ihrec 
pons and storing a header component ol a frame in a memory of the communication device 
having at least three pons and storing at least a portion of a payload component in the memory: 
producing a header component different than the received header component: 
transmitting I/O information on at least one ol the two other ports, using the produced 
header component in at least one frame of the transmitted I/O information. 

16. A method of communicating I/O information between three devices in which the I/O 
information is organized according to a multiphase protocol, having a command phase, a data 
phase, and a status phase, the method comprising the acts ol": 

receiving command phase information on a first port ol a communication device having 
at least three pons; 

transmitting the command phase information received by the communication device 

having at least three pons on a second port of the communication device having at least three 

P° re- 
transmitting a modified version of the command phase information received by the 

communication device having at least three pons on a third port of the communication device 

having at least three pons; 

receiving data phase information on the first port; 

transmitting data phase information on the second and third pons in response to receiving 
data phase information on the first port. 

17. An intelligent splitter system, comprising 
an I/O device having 

input/output (I/O) port logic for transmitting and receiving information on 
at least three I/O communication links according .to a standardized I/O protocol in 
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which I/O operations may be requested on storage locations identified by physical 

address information; and 

extended function logic, cooperating with the port logic, 10 perform at 

least one extended function not specified in the standardized I/O protocol, 

wherein the at least one extended function operates in a physical address domain 

of physical addresses; and 
a host system having a processor and memory storing processor-executable instructions 
to map logical storage object names to a physical address domain and storing instructions 
to configure the I/O device with a mapped physical address domain to enable the I/O 
device to perform the at least one extended function oh the physical address domain 
corresponding to a mapped logical storage object name. 

18. The intelligent splitter system of claim 17 wherein the host includes instructions to 
configure the I/O device with a session definition that defines a physical address domain and a 
corresponding set of actions for the I/O to perform for I/O operations in the defined physical 
address domain, and wherein the I/O device includes logic for configuring the extended function 
logic with the session definition. 

19. The intelligent splitter system of claim 17 wherein the I/O device includes service request 
logic to create and send I/O information on a specified communication link connected to the I/O 
device, and wherein the host includes instructions to communicate a service request to the I/O 
device, specifying an I/O operation and a physical address to invoke the service request logic of 
the I/O device. 

20. The intelligent splitter system of claim I ? 

wherein the I/O device includes a processor and memory having processor-executable 
instructions to implement the extended function logic and processor-executable instructions 
defining a first application program interface (API) specifying functionality provided by the I/O 
device, 

wherein the host system memory further includes processor-executable instructions 
defining a second API, specifying functionality provided by the host system, and 
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wherein the second API includes processor-executable instructions in invoke the I'irsl 

API 

21. The intelligent splitter system oi claim 17 further comprising a network connection, and 
wherein the host memory includes processor-executable instructions to transmit requests over the 
network connection to obtain information describing storage at a target communication node of a 
network on which the network connection is connected, and wherein the host memory further 
includes processor-executable instructions to transmit information received in reply to such 
requests to the instructions that map logical storage object names to a physical address domain. 

22. The intelligent splitter system of claim 17 further comprising I/O logic to transmit and 
receive information according to another standardized I/O protocol, and wherein the host 
memory further includes processor-executable instructions to transmit and receive information to 
the I/O logic. 

23. A method or managing data wichin an I/O system having a first device, a second device, 
and a third device, wherein each device is capable of transmitting and receiving L/O information 
organized according to a standardized I/O protocol, wherein the standardized I/O protocol has an 
associated set of I/O functions, the method comprising the acts of: 

receiving a command to perform an extended function on data identified by a logical 
object name; 

mapping the logical object name to a physical address domain; . 

monitoring a communication link in the I/O system to determine if the iink is carrying an 
I/O operation in the mapped physical address domain: , 

if an I/O operation is being carried on the link within the mapped physical address 
domain, intercepting that operation and performing the extended function identified in the 
received command. 

24. A method of managing data within an I/O system having a first device, a -second device, 
and a third device, wherein each device can transmit and receive I/O information organized 
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according lo a standardized I/O protocol having an associated set of I/O functions-defined in the 
protocol, the method comprising the acts of: 

receiving I/O information from the first, second, and third devices into a splitter system 
capable of and transmitting and receiving I/O information among the first; second, and third 
devices; 

causing the splitter device to generate I/O information, independently of received I/O 
information; 

causing the splitter to transmit the generated I/O information to at least one of the first, 
second, and third devices. 

25. A method of managing data within an I/O system having a first device, a second device, 
and a third device, wherein each device is capable of transmitting and receiving I/O information 
organized according to a standardized I/O protocol, wherein the standardized I/O protocol is 
frame-based with each frame having a header component and payload component, the method 
comprising the acts of: 

receiving I/O information into a communication device having at least three I/O 
communication links, in which the I/O. communication device includes a predefined interface for 
manipulating at least one of the header and payload components of the I/O information received 
by the communication device; ^ ? 

a host sending commands to the predefined interface to control the communication 

device. 

26. A method of clustering two computing systems in which each computing system includes 
an I/O communication link, comprising: 

receiving I/O information from one of the twocommunication links into a first multi-port 
device having a processor and memory wiih instructions to determine if I/O information received 
on the link is within a first predefined set of I/O operations, the first multi-port device being 
connected to a first storage system; 

receiving I/O information from a second of the two communication links into a second 
multi-port device having a processor and. memory with instructions to determine if I/O 
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information received on ihc link is within a second predefined set of I/O operations, the second 
multi-port device being -.connected to a second storage. system;, j 

sending the I/O informal ton received on. the one communication link to the first storage 

system; 

sending the I/O information received on the second communication link to the second 
storage system; „ 

determining if . the received I/O information on the one communication link is within the 
first predefined set of I/O operations and, if so, sending an I/O operation indicative of the 
received I/O information to f a port oi; the second multi-port device; 

determining if the received I/O iriformatipn An the, second communication link is within 
the second predefined set of 1/0 operation^ anq\. if so, sending an I/O operation indicative of the 
received I/O information to r a port of the first multi-port dpvicc. 

27. The method of claim 26 wherein the first multi-port device caches information received 
trom the second multi-port device and wherein a computer of a first of the two computing 
systems may read information from the cache. 

28. The method of claim 26 wherein a first of the two computing systems transmits I/O 
information to the first storage system according to a first organization, and wherein the second 
of the two computing systems transmits I/O information to the second storage system according 
to a second organization, and wherein if the first multi-port device transmits an I/O operation to a 
port of the second multi-port device, the first multi-port device sends the operation according to 
the second organization. 

29. The method of claim 1 wherein the I/O information is organized according to a 
multiphase I/O protocol having a command phase, a data phase, and a status phase, and wherein 
the method further comprises 

the first multi-port device receives status phase information from the first storage system 
and receives corresponding status phase information from the second multi-port device; and 

the first multi-port device transmits status phase information to the first of the two 
computing systems. 
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30 The method of claim 29 wherein first multi-port dcv.ee dctcrm.ncs a worst ease status of : 
the received status phase information and sends status phase mformation- indicative ot thc worst 
case to the first computing system. 

31. An I/O clustering apparatus, comprising 

a storage control un.t lor receiving and transmitting information according to a 

predefined I/O protocol: 

a multi-port splitter device having a first port for receiving and transmitting I/O 
mformation according to the predefined protocol, a second port for communicating with another 
splitter dcv!cc. and a communication path connected to the storage control unit; wherein the 
multi-port splitter includes logic for determining if I/O information received on the first port is 
within a predefined set of operations .1 so sending indicative information therefor to the second 
port. 
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